AITopics | bootstrapping upper confidence bound

Bootstrapping Upper Confidence Bound

Neural Information Processing SystemsFeb-6-2026, 04:11:25 GMT

Upper Confidence Bound (UCB) method is arguably the most celebrated one used in online decision making with partial information feedback. Existing techniques for constructing confidence bounds are typically built upon various concentration inequalities, which thus lead to over-exploration. In this paper, we propose a non-parametric and data-dependent UCB algorithm based on the multiplier bootstrap. To improve its finite sample performance, we further incorporate second-order correction into the above construction. In theory, we derive both problem-dependent and problem-independent regret bounds for multi-armed bandits under a much weaker tail assumption than the standard sub-Gaussianity. Numerical results demonstrate significant regret reductions by our method, in comparison with several baselines in a range of multi-armed and linear bandit problems.

artificial intelligence, data mining, machine learning, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.44)

Add feedback

Reviews: Bootstrapping Upper Confidence Bound

Neural Information Processing SystemsJan-23-2025, 05:47:27 GMT

I should be acknowledged that it is significantly more complex that UCB1 for example. Indeed at each time step B bootstrap repetitions are needed to estimated the bootstrapped quantiles, and each of them require to drawn n_k random variables for each arm k (the values of w's). Also, this requires to store the past rewards obtained on all arms, which requires a lot a memory. This constraint is also needed for the empirical KL-UCB mentioned above, which is one more reason to compare the two algorithms that have similar complexity. From Theorem 2, I guess that the w's are Rademacher random variables, but it would be good to specify this in the statement of the algorithm.

algorithm, bootstrapping upper confidence bound, upper confidence bound, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.40)

Add feedback

Reviews: Bootstrapping Upper Confidence Bound

Neural Information Processing SystemsJan-23-2025, 05:47:16 GMT

The reviewers updated their scores after the rebuttal and discussion. Congratulations on a nice paper that had a consensus on acceptance! The reviewers has a couple of outstanding concerns (like relating B,T) that I would like the authors to explicitly discuss (including potentially mentioning open problems) in the camera-ready version.

bootstrapping upper confidence bound, reviewer

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.40)

Add feedback

Bootstrapping Upper Confidence Bound

Neural Information Processing SystemsOct-9-2024, 21:53:57 GMT

Upper Confidence Bound (UCB) method is arguably the most celebrated one used in online decision making with partial information feedback. Existing techniques for constructing confidence bounds are typically built upon various concentration inequalities, which thus lead to over-exploration. In this paper, we propose a non-parametric and data-dependent UCB algorithm based on the multiplier bootstrap. To improve its finite sample performance, we further incorporate second-order correction into the above construction. In theory, we derive both problem-dependent and problem-independent regret bounds for multi-armed bandits under a much weaker tail assumption than the standard sub-Gaussianity.

bootstrapping upper confidence bound, upper confidence bound

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.40)

Add feedback

Bootstrapping Upper Confidence Bound

Hao, Botao, Yadkori, Yasin Abbasi, Wen, Zheng, Cheng, Guang

Neural Information Processing SystemsMar-19-2020, 01:33:00 GMT

Upper Confidence Bound (UCB) method is arguably the most celebrated one used in online decision making with partial information feedback. Existing techniques for constructing confidence bounds are typically built upon various concentration inequalities, which thus lead to over-exploration. In this paper, we propose a non-parametric and data-dependent UCB algorithm based on the multiplier bootstrap. To improve its finite sample performance, we further incorporate second-order correction into the above construction. In theory, we derive both problem-dependent and problem-independent regret bounds for multi-armed bandits under a much weaker tail assumption than the standard sub-Gaussianity.

bootstrapping upper confidence bound, upper confidence bound

Neural Information Processing Systems

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.40)

Add feedback

Filters

Collaborating Authors

bootstrapping upper confidence bound

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Bootstrapping Upper Confidence Bound

Reviews: Bootstrapping Upper Confidence Bound

Reviews: Bootstrapping Upper Confidence Bound

Bootstrapping Upper Confidence Bound

Bootstrapping Upper Confidence Bound